Agent Skills: Automatic Speech Recognition (ASR)
Transcribe audio segments to text using Whisper models. Use larger models (small, base, medium, large-v3) for better accuracy, or faster-whisper for optimized performance. Always align transcription timestamps with diarization segments for accurate speaker-labeled subtitles.
UncategorizedID: benchflow-ai/skillsbench/Automatic Speech Recognition (ASR)
278174
Install this agent skill to your local
Skill Files
Browse the full folder contents for Automatic Speech Recognition (ASR).
Loading file tree…
Select a file to preview its contents.